SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) Ap\lCANT: HOGREFE, Holly 
lANSEN, Connie J 

(ii) TITLE OFHNVENTION: Polymerase Enhancing Factor (PEF) 

Extracts, PEF Protein Complexes, Isolated PEF Proteins, 
and Methods tor Purifying and Identifying Them 

(iii) NUMBER OF SEQUENCES: 89 



(iv) CORRESPONDENCEVdDRESS: 

(A) ADDRESSEE: DavidS[. Kulik,^ 
Lenahan P.L.L.C. 

(B) STREET: 1200 G Street,^ Suite 700 

(C) CITY: Washington 

(D) STATE: DC 
(F) ZIP: 20005 



enson, McKeown, Edwards & 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: /Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-D( 

(D) SOFTWARE: Paientln Release #1.0, VersioK#1.25 

(vi) CURRENT APPLIckTION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 2U-OCT-1997 

(C) CLASSIFICATIOI 

(viii) ATTORNEY/AGENT E^ORMATION: 

(A) NAME: KULIK, DaviVj 

(B) REGISTRATION NUMBER: 36,576 

(C) REFERENCE/DOCKET NUMBER: 1486/43 163cp 



(ix) TELECOMMUNICATION INFOI 

(A) TELEPHONE: 202 628-8800 

(B) TELEFAX; 202 628-8844 



# # 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Xaa Leu His His Val Lys Leu lie Tyr Ala Thr Xaa Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Leu Leu His His Val Lys Leu lie Tyr Ala Thr Lys Xaa Arg Xaa 
15 10 15 

Leu Val Gly Lys Xaa lie Val Leu Ala lie Pro Gly Xaa Xaa Ala Xaa 
20 25 30 

Xaa Xaa Xaa 
35 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Glu Xaa 
15 10 15 

Xaa Xaa 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(D) -TOPOLOGY: unknown / 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Tyr Asp Ala Val lie Met Ala Ala Ala Val Val Asp Phe Arg Pro 
15 10 15 

Lys 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

ANTI-SENSE: NO 

FRAGMENT TYPE: internal 



(IV) 
(V) 



SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
5 10 15 

Asn Gin Val Val Leu He Gly Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) 

Ala 
1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Gly Ala Met Leu His His Val Lys Leu He Tyr Ala Xaa Lys Leu Arg 
1 5 10 15 



Lys 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminai 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Gly Ala Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Xaa Xaa 
15 10 15 

Arg Lys 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: cimino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 




(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Lys Phe Arg Lys Glu Glu Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Ala lie Leu Leu Pro Asp Trp Lys lie Arg Lys Glu lie Leu lie 
1 5 10 15 

Glu 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Met His His Val lie Lys Leu Xaa Tyr Ala Thr Xaa Ser Arg Lys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Tyr Leu Val Arg Pro Asp Trp Lys Arg Arg Lys Glu lie Leu 
15 10 15 

He Glu 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CAYCAYGAHA ARYTHATTTA CGC 23 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 




(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCCATDATNA CDGCRTCGTA TTT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAYCAYGAHA ARYTHATATA CGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ARDACDACYT GRTTTTCTTC 




(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

ATGCTTCACC ACGTCAAGCT AATCTACGCC ACAAAAAGTC GAAAGCTAGT TGGAAAAAAG 60 

ATAGTCNNNN NNNNNCCAGG GAGTATTGCG GCTTTGGATG TGAAAGCTTG TGAGGGACTA 120 

ATTAGGCATG GGGCCGAAGT TCATGCAGTG ATGAGTGAGG CAGCCACCAA GATAATTCAT 180 

CCTTATGCAT GGAATTTGCC CACGGGAAAT CCAGTCATAA CTGAGATCAC TGGATTTATC 240 

GAGCATGTTG AGTTAGCAGG GGAACATGAG AATAAAGCAG ATTTAATTTT GGTTTGTCCT 300 

GCCACTGCCA ACACAATTAG TAAGATTGCA TGTGGAATAG ATGATACTCC AGTAACTACA 360 

GTCGTGACCA CAGCATTTCC CCACATTCCA ATTATGATAG CCCCAGCAAT GCATGAGACA 420 

ATGTACAGGC ATCCCATAGT AAGGGAGAAC ATTGAAAGGT TAAAGAAGCT TGGCGTTGAG 480 

TTTATAGGAC CAAGAATTGA GGAGGGAAAG GCAAAAGTTG CAAGCATTGA TGAAATAGTT 540 

TACAGAGTTA TTAAAAAGCT CCACAAAAAA ACATTGGAAG GGAAGAGAGT CCTAGTAACG 600 

GCGGGAGCAA CAAGAGAGTA CATAGATCCA ATAAGATTCA TAACAAATGC CAGCAGTGGA 660 

AAAATGGGAG TAGCGTTGGC TGAAGAAGCA GATTTTAGAG GAGCTGTTAC CCTCATAAGA 720 

ACAAAGGGAA GTGTAAAGGC TTTTAGAATC AGAAAAATCA AATTGAAGGT TGAGACAGTG 780 

GAAGAAATGC TTTCAGCGAT TGAAAATGAG TTGAGGAGTA AAAAGTATGA CGTAGTTATT 840 

ATGGCAGCTG CTGTAAGCGA TTTTAGGCCA AAAATTAAAG CAGAGGGAAA AATTAAAAGC 900 

GGAAGATCAA TAACGATAGA GCTCGTTCCN NNNAATCCCA AAATCATTGA TAGAATAAAG 960 

GAAATTCAAC CAAATGTCTT TCTTGTTGGA TTTAAAGCAG AAACTTCAAA AGAAAAGCTT 1020 

ATAGAAGAAG GTAAAAGGCA GATTGAGAGG GCCAAGGCTG ACTTAGTCGT TGGTAACACA 1080 

TTGGAAGCCT TTGGAAGCGA GGAAAACCAA GTAGTATTAA TTGGCAGAGA TTTCACAAAA 1140 




GAACTTCCAA AAATGAAAAA GAGAGAGTTA GCAGAGAGAA TTTGGGATGA GATAGAGAAA 1200 

1209 

TTNCTGTCC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu His His val Lys Leu He Tyr Ala Thr Lys Ser Arg Lys Leu 
1 5 10 15 

Val Gly Lys Lys lie Val Xaa Xaa Xaa Pro Gly Ser He Ala Ala Leu 
20 25 30 

Asp Val Lys Ala Cys Glu Gly Leu He Arg His Gly Ala Glu Val His 
35 40 45 

Ala val Met Ser Glu Ala Ala Thr Lys He He His Pro Tyr Ala Trp 
50 55 60 

Asn Leu Pro Thr Gly Asn Pro Val He Thr Glu He Thr Gly Phe He 
65 70 75 80 

Glu His val Glu Leu Ala Gly Glu His Glu Asn Lys Ala Asp Leu He 
85 90 95 

Leu val cys Pro Ala Thr Ala Asn Thr He Ser Lys He Ala Cys Gly 
100 105 110 

He Asp Asp Thr Pro Val Thr Thr Val Val Thr Thr Ala Phe Pro His 
115 120 125 

He Pro lie Met He Ala Pro Ala Met His Glu Thr Met Tyr Arg His 
130 135 140 

Pro He val Arg Glu Asn He Glu Arg Leu Lys Lys Leu Gly Val Glu 
145 " 150 155 ISO 

Phe He Gly Pro Arg He Glu Glu Gly Arg Ala Lys Val Ala Ser He 



165 170 I'^S 

Tyr Arg Val lie Lys 
180 185 



Asp Glu He val Tyr Arg Val He Lys Lys Leu His Lys Lys Thr Leu 



Glu 



Gly Lys Arg Val Leu Val Thr Ala Gly Ala Thr Arg Glu Tyr He 



195 



200 



205 



Asp Pro lie Arg Phe lie Thr Asn Ala Ser Ser Gly Lys Met Gly Val 
210 215 220 

Ala Leu Ala Glu Glu Ala Asp Phe Arg Gly Ala Val Thr Leu lie Arg 
225 230 235 240 

Thr Lys Gly Ser Val Lys Ala Phe Arg lie Arg Lys lie Lys Leu Lys 
245 250 255 

Val Glu Thr Val Glu Glu Met Leu Ser Ala He Glu Asn Glu Leu Arg 
260 265 270 

Ser Lys Lys Tyr Asp Val Val He Met Ala Ala Ala Val Ser Asp Phe 
275 280 285 

Arg Pro Lys He Lys Ala Glu Gly Lys He Lys Ser Gly Arg Ser He 
290 295 300 

Thr He Glu Leu Val Pro Xaa Asn Pro Lys He He Asp Arg He Lys 
305 310 315 320 



■bis? 



Glu He Gin Pro Asn Val Phe Leu Val Gly Phe Lys Ala Glu Thr Ser 

325 330 335 

Lys Glu Lys Leu He Glu Glu Gly Lys Arg Gin He Glu Arg Ala Lys 

340 345 350 



Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
355 360 365 

Asn Gin Val Val Leu He Gly Arg Asp Phe Thr Lys Glu Leu Pro Lys 
370 375 380 

Met Lys Lys Arg Glu Leu Ala Glu Arg He Trp Asp Glu He Glu Lys 
385 390 395 400 



Xaa Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CATAGCGAAT TCGCAAAACC TTTCGCGGTA TGG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ACTACGGAAT TCCACGGAAA ATGCCGCTCA TCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGCGTTTCCG TTCTTCTTCG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ~ LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 




(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CCATCTCACG CGCCAGTTTC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GAGCTTGCTC AACTTTATC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 
GATAGAGATA GTTTCTGGAG ACG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

CGGGATATCG ACATTTCTGC ACC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CAGGACTCAG AAGCTGCTAT CGAA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
CTGCACGTGC CCTGTAGGAT TTGT 




(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
CCAGAYTGGA ARWKNAGGAA AGA 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CCAGAYTGGA ARWKNAGAAA AGA 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CCAGAYTGGA ARWKNAGGAA GGA . 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : u nkno wn 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCAGAYTGGA ARWKNAGAAA GGA 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CAGAGTGGGC AGAGAGGCTN TTGTTAAGGG GAAATTAATC GACGTGGAAA 
AGGAAGGAAA 60 

AGTCGNTATT CCTCCAAGGG AATA 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Giu Trp Ala Glu Arg Leu Leu Leu Arg Giy Asn Xaa Ser Lys Trp Lys 
IS 10 IS 

Arg Lys Glu Lys Ser Xaa Phe Leu Gin Gly Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Arg Val Gly Arg Glu Ala Xaa Val Lys Gly Lys Leu He Glu Val Glu 
15 10 15 

Lys Glu Gly Lys Val Xaa He Pro Pro Arg Glu 
20 25 

(2) INFORMATION, FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) .tOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 




(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Gin Ser Gly Gin Arg Gly Xaa Cys Xaa Gly Glu He Asn Arg Ser Gly 
15 10 15 

Lys Gly Arg Lys Ser Arg Tyr Ser Ser Lys Gly Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTGCCCACTC TGAGGTCATA ACCTGCTGGT TGGAGCCATT CTTCAGAAAA TGGCTCTATA 60 
AGTATTTCTT TTCTGATTTT CCAGTCTGGA AGTAGCATTT TACCACCGAA ACCTTTATTT 120 
TTAATTTAA 129 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Xaa He Lys Asn Lys Gly Phe Gly Gly Lys Met Leu Leu Pro Asp Trp 
1 5 10 15 




Lys lie Arg Lys Glu He Leu He 
20 

Gin Pro Ala Gly Tyr Asp Leu Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 




Glu Pro Phe Ser Glu Glu Trp Leu 
25 30 

Val Gly 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

TCCTCCAAGG GAATACGCCT TAATCCTAAC CCTCGAGAGG ATAAAGTTGC CCGACGATGT 60 

TATGGGGGAT ATGAAGATAA GGAGCAGTTT AGCAAGAGAA GGGGTTATTG GTTCTTTTGC 120 

TTGGGTTGAC CCAGGATGGG ATGGAAACTT AACACTAATG CTCTACAATG CCTCAAATGA 180 

ACCTGTCGAA TTAAGATATG GAGAGAGATT TGTGCAGATC GCATTTATAA GGCTAGAGGG 240 

TCCGGCAAGA AACCCTTACA GAGGAAACTA TCAGGGGAGC ACAAGGTTAG CGTTTTCAAA 300 

GAGAAAGAAA CTCTAGCGTC TTTTCAATAG CATCCTCAAT ATCTCGTGTG AAGTAATCAA 360 

TGTAAATACT TGCTGGGTGG GTTTTTAGGG ATTCAAACTC GTAAGATGGG CCTGTATAGC 420 

AGAAAACTAT TTTTGCCTCT TCTTCATTTA TCTTTCTGTG AATAAAAAAT CCAACATCCA 480 

CACTAGTTCC AAAAGATATT GTTTGCGTGA TTACCAACAA GATCTTGGCA TTATTTTTGA 540 

TCTTATACTC TATTCTCCTT TCTCCCTCCA ATTTGCCCAA AATAAACCTG GGTAGTATAC 600 

ATTCACTCCT CTCTTTTAAA TTCCTATAAA TTCGTACATA GTTTAGAAAA ATGTCAAATT 660 

CTTTNTTCCC TGTTAAATTA ACCNCNAAAT CTTTATNANN AANCTTTTTA TAATTCCCAA 720 

AACCCCTAAT TTTCCCCTTN 740 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 



(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Leu Gin Gly Asn Thr Pro Xaa Ser Xaa Pro Ser Arg Gly Xaa Ser Cys 
15 10 15 

Pro Thr Met Leu Trp Gly lie Xaa Arg Xaa Gly Ala Val Xaa Gin Glu 
20 25 30 

Lys Gly Leu Leu Val Leu Leu Leu Gly Leu Thr Gin Asp Gly Met Glu 
35 40 45 

Thr Xaa His Xaa Cys Ser Thr Met Pro Gin Met Asn Leu Ser Asn Xaa 
50 55 60 

Asp Met Glu Arg Asp Leu Cys Arg Ser His Leu Xaa Gly Xaa Arg Val 
65 70 75 80 

Arg Gin Glu Thr Leu Thr Glu Glu Thr lie Arg Gly Ala Gin Gly Xaa 
85 90 95 

Arg Phe Gin Arg Glu Arg Asn Ser Ser Val Phe Ser lie Ala Ser Ser 
100 105 110 

He Ser Arg Val Lys Xaa Ser Met Xaa He Leu Ala Gly Trp Val Phe 
115 120 125 

Arg Asp Ser Asn Ser Xaa Asp Gly Pro Val Xaa Gin Lys Thr He Phe 
130 135 140 

Ala Ser Ser Ser Phe He Phe Leu Xaa He Lys Asn Pro Thr Ser Thr 
145 150 155 160 

Leu Val Pro Lys Asp He Val Cys Val He Thr Asn Lys He Leu Ala 
165 170 175 

Leu Phe Leu" He Leu Tyr Ser He Leu Leu Ser Pro Ser Asn Leu Pro 
180 185 190 

Lys He Asn Leu Gly Ser He His Ser Leu Leu Ser Phe Lys Phe Leu 
195 200 205 

Xaa He Arg Thr Xaa Phe Arg Lys Met Ser Asn Ser Xaa Phe Pro Val 
210 215 220 




Lys Leu Thr Xaa Lys Ser Leu Xaa Xaa Xaa Phe Leu Xaa Phe Pro Lys 
225 230 235 240 

Pro Leu lie Phe Pro Xaa 
245 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Pro Arg Glu Tyr Ala Leu lie Leu Thr Leu Glu Arg lie Lys Leu 
15 10 15 

Pro Asn Asn Val Met Gly Asp Met Lys lie Arg Ser Ser Leu Ala Arg 
20 25 30 

Glu Gly Val lie Gly Ser Phe Ala Trp Val Asp Pro Gly Trp Asp Gly 
35 40 45 

Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu Pro Val Glu Leu 
50 55 60 

Arg Tyr Gly Glu Arg Phe Val Gin lie Ala Phe lie Arg Leu Glu Gly 
65 70 75 80 

Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly Ser Thr Arg Leu 
85 90 95 

Ala Phe Ser Lys Arg Lys Lys Leu Xaa Arg Leu Phe Asn Ser lie Leu 
100 105 110 

Asn He Ser Cys Glu Val He Asn Val Asn Thr Cys Trp Val Gly Phe 
115 120 125 

Xaa Gly Phe Lys Leu Val Arg Trp Ala Cys He Ala Glu Asn Tyr Phe 
130 _ 135 140 

Cys Leu Phe Phe He Tyr Leu Ser Val Asn Lys Lys Ser Asn He His 
145 150 155 160 

Thr Ser Ser Lys Arg Tyr Cys Leu Arg Asp Tyr Gin Gin Asp Leu Gly 
165 170 175 

He He Phe Asp Leu He Leu Tyr Ser Pro Phe Ser Leu Gin Phe Ala 
180 185 190 



Gin Asn Lys Pro Gly Xaa Tyr Thr Phe Thr Pro Leu Phe Xaa lie Pro 
195 200 205 

He Asn Ser Tyr He Val Xaa Lys Asn Val Lys Phe Phe Xaa Pro Cys 
210 215 220 

Xaa He Asn Xaa Xaa He Phe Xaa Xaa Xaa Leu Phe He He Pro Lys 

235 240 



225 



230 



Thr Pro Asn Phe Pro Leu 
245 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE; amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE; peptide 
(iii) HYPOTHETICAL: YES 



xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 



ser ser Lys Gly He Arg Leu Asn Pro Asn Pro Arg Glu Asp Lys Val 
15 10 15 

Ala Arg Arg Cys Tyr Gly Gly Tyr Glu Asp Lys Glu Gin Phe Ser Lys 
20 25 30 

Arg Arg Gly Tyr Trp Phe Phe Cys Leu Gly Xaa Pro Arg Met Gly Trp 
35 40 45 

Lys Leu Asn Thr Asn Ala Leu Gin Cys Leu Lys Xaa Thr Cys Arg He 
50 55 60 

Lys He Trp Arg Glu He Cys Ala Asp Arg He Tyr Lys Ala Arg Gly 
65 70 75 

ser Gly Lys Lys Pro Leu Gin Arg Lys Leu Ser Gly Glu His Lys Val 



85 



90 



ser val Phe Lys Glu Lys Glu Thr Leu Ala Ser Phe Gin Xaa His Pro 
100 105 110 

Gin Tyr Leu Val Xaa Ser Asn Gin Cys Lys Tyr Leu Leu Gly Gly Phe 
115 120 125 

Leu Gly He Gin Thr Arg Lys Met Gly Leu Tyr Ser Arg Lys Leu Phe 
130 135 140 



Leu Pro Leu Leu His Leu Ser Phe Cys Glu Xaa Lys He Gin His Pro 
145 150 155 



His Xaa Phe Gin Lys He Leu Phe Ala Xaa Leu Pro Thr Arg Ser Trp 
165 170 ^175 

His Tyr Phe Xaa Ser Tyr Thr Leu Phe Ser Phe Leu Pro Pro lie Cys 
180 185 190 

Pro Lys Xaa Thr Trp Val Val Tyr He His Ser Ser Leu Leu Asn Ser 
195 200 205 

Tyr Lys Phe Val His Ser Leu Glu Lys Cys Gin lie Leu Xaa Ser Leu 
210 215 220 

Leu Asn Xaa Pro Xaa Asn Leu Tyr Xaa Xaa Xaa Phe Tyr Asn Ser Gin 
225 230 235 240 

Asn Pro Xaa Phe Ser Pro 
245 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Leu His His Val Lys Leu He Tyr Ala Thr Lys Ser Arg Lys Leu 
15 10 15 

Val Gly Lys Lys He Val Xaa Xaa Xaa Pro Gly Ser He Ala Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID.NO:47: 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Lys Tyr Asp Val Val He Met Ala Ala Ala Val Ser Asp Phe Arg Phe 
15 10 15 

Lys 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: cimino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
15 10 15 

Asn Gin Val Val Leu He Gly Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 




(iv) ANTI -SENSE: NO 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CTATTGAGTA CGAACGCCAT C 21 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GTCACGCTTG CTCCACTCCG 20 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanococcus Jannaschii 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met He Ser Glu He Met His Pro Thr Lys Leu Leu Lys Gly Thr Lys 
15 10 15 

Ser Lys Leu Leu Glu Asn Lys Lys He Leu Val Ala Val Thr Ser Ser 
* " 20 25 30 

He Ala Ala He Glu Thr Pro Lys Leu Met Arg Glu Leu He Arg His 
35 40 45 

Gly Ala Glu Val Tyr Cys He He Thr Glu Glu Thr Lys Lys He He 
50 55 60 



Gly Lys Glu Ala Leu Lys Phe Gly Cys Gly Asn Glu Val Tyr Glu Glu 



65 



70 



75 



80 



lie Thr Gly Xaa Xaa Xaa Xaa Xaa Asp lie Glu His lie Leu Leu Tyr 
85 90 95 

Xaa Xaa Xaa Xaa Asn Glu Cys Asp Cys Leu Leu lie Tyr Pro Ala Thr 
100 105 110 

Ala Asn lie lie Ser Lys lie Asn Leu Gly lie Ala Asp Asn lie Vai 
115 120 125 

Asn Thr Thr Ala Leu Met Phe Phe Gly Asn Lys Pro lie Phe lie Vai 
130 135 140 

Pro Ala Met His Glu Asn Met Phe Asn Xaa Xaa Ala lie Lys Arg His 
145 150 155 160 

He Asp Lys Leu Lys Glu Lys Asp Lys lie Tyr lie lie Ser Pro Lys 
165 170 175 

Phe Glu Giu Xaa Xaa Xaa Xaa Xaa Xaa Gly Lys Ala Lys Vai Ala Asn 
180 185 190 

He Glu Asp Vai Vai Lys Ala Vai lie Giu Lys lie Gly Asn Asn Leu 
195 200 205 

Lys Lys Giu Gly Asn Arg Vai Leu lie Leu Asn Gly Gly Thr Vai Giu 
210 215 220 

Phe lie Asp Lys Vai Arg Vai lie Ser Asn Leu Ser Ser Gly Lys Met 
225 230 235 240 

Gly Vai Ala Leu Ala Giu Ala Phe Cys Lys Giu Gly Phe Tyr Vai Giu 
245 250 255 

Vai lie Thr Ala Met Gly Leu Glu Pro Pro Tyr Tyr lie Lys Asn His 
260 265 .270 

Lys Vai Leu Thr Ala Lys Giu Met Leu Asn Lys Ala lie Giu Xaa Xaa 
275 280 285 

Leu Xaa Ala Lys Asp Phe Asp lie lie He Ser Ser Ala Ala lie Ser 
290 295 300 

Asp Phe Thr Vai Giu Ser Xaa Phe Giu Gly Lys Leu Ser Ser Giu Giu 
305 310 315 320 

Glu Xaa Xaa Xaa Xaa Leu lie Leu Lys Leu Lys Arg Xaa Asn Pro Lys. 

325 330 335 



Vai Leu Glu Giu Leu Arg Arg lie Tyr Lys Asp Xaa Lys Vai lie lie 
340 345 350 



Gly Phe Lys Aia Giu Tyr Asn Leu Asp Giu Lys Giu Leu lie Asn Arg 
355 360 365 



; 1 5 



Ala Lys Glu Arg Leu Asn Lys Tyr Asn Leu Asn Met lie He Ala A3n 
370 375 380 

Asp Leu Ser Lys Xaa Xaa His Tyr Phe Gly Asp Asp Tyr He Glu Val 
385 390 395 400 

Tyr He lie Thr Lys Tyr Glu Val Glu Lys He Ser Gly Ser Lys Lys 
405 410 415 

Xaa Glu He Ser Glu Arg He Val Glu Lys Val Lys Lys Leu Val Lys 
420 425 430 

Ser Xaa Xaa Xaa Xaa 
435 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Ala Arg Gin Gin Lys Tyr Cys Asp Lys He Ala Asn Phe Trp 

15 10 15 

Cys His Pro Thr Gly Lys He He Met Ser Leu Ala Gly Lys Lys He 
20 25 30 

Val Leu Gly Val Ser Gly Gly He Ala Ala Tyr Lys Thr Pro Glu Leu 
35 40 45 

Val Arg Arg Leu Arg Asp Arg Gly Ala Asp Val Arg Val Ala Met Thr 
50 55 60 

Glu Ala Ala Lys Ala Phe He Thr Pro Leu Ser Leu Gin Ala Val Ser 
65 70 75 80 

Gly Tyr Pro Val Ser Asp Ser Leu Leu Asp Pro Ala Ala Glu Ala Ala 

85 90 95 

Met Gly His He Glu Leu Gly Xaa Xaa Xaa Xaa Lys Trp Ala Asp Leu 
100 105 110 



Val He Leu Ala Pro Ala 



Thr 



Ala Asp 



Leu He Ala Arg Val Ala Ala 



115 120 125 

Gly Met Ala Asn Asp Leu Val Ser Thr He Cys Leu Ala Thr Pro Xaa 
130 135 140 

Xaa Ala Pro Val Ala Val Leu Pro Ala Met Asn Gin Gin Met Tyr Arg 
145 150 155 160 

Ala Ala Ala Thr Gin His Asn Leu Glu Val Leu Ala Xaa Ser Arg Gly 
165 170 175 

Leu Leu He Trp Gly Pro Asp Ser Gly Ser Gin Ala Cys Gly Asp He 
180 185 190 

Gly Pro Gly Arg Xaa Xaa Asp Pro Leu Thr He Val Asp Met Ala Val 
195 200 205 

Ala His Phe Ser Pro Val Asn Asp Leu Lys His Leu Asn He Met He 
210 215 220 

Thr Ala Gly Pro Thr Arg Glu Pro Leu Asp Pro Val Arg Tyr He Ser 
225 230 235 240 

Asn His Ser Ser Gly Lys Met Gly Phe Ala He Ala Ala Ala Ala Ala 
245 250 255 

Arg Arg Gly Ala Asn Val Thr Leu Val Ser Gly Pro Val Ser Leu Pro 
260 265 270 

Thr Pro Pro Phe Val Lys Arg Val Asp Val Met Thr Ala Leu Glu Met 
275 280 285 

Glu Ala Ala Val Asn Xaa Xaa Ala Ser Val Gin Gin Gin Asn He Phe 
290 295 300 

He Gly Cys Ala Ala Val Ala Asp Tyr Arg Ala Ala Thr Val Ala Pro 
305 310 315 320 

Glu Lys He Lys Lys Gin Ala Thr Gin Gly Asp Glu Leu Thr He Lys 
325 330 335 

Met Val Lys Xaa Asn Pro Asp He Val Ala Gly Val Ala Ala Leu Lys 
340 345 350 

Asp His Arg Pro Tyr Val Val Gly Phe Ala Ala Glu Thr Asn Asn Xaa 
355 360 365 

Xaa Xaa 'Xaa Val Glu Glu Tyr Ala Arg Gin Lys Arg He Arg Lys Asn 
370 375 380 

Leu Asp Leu He Cys Ala Asn Asp Val Ser Gin Pro Thr Gin Gly Phe 
385 390 395 400 

Asn ser Asp Asn Asn Ala Leu His Leu Phe Trp Gin Asp Gly Asp Lys 
405 410 415 



Val Leu Pro Leu Glu Arg Lys Giu Leu Leu Gly Gin Leu Leu Leu Asp 
420 425 430 



Glu He Val Thr Arg Tyr Asp Glu Lys Asn Arg Arg 
435 440 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO . 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Ala Gly Trp lie Asp Ala Gly Phe Lys Gly Lys lie Thr Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Pro Thr He Val Asp Ala Gly Phe Glu Gly Gin Leu Thr He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 58: 



# # 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Ala His Arg lie Asp Pro Gly Trp Ser Gly Cys lie Val Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
( 0 ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



CAGGACTCAG AAGCTGCTAT CGAA 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 

CTGCACGTGC CCTGTAGGAT TTGT 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CTATTGAGTA CGAACGCCAT C 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



GTCACGCTTG CTCCACTCCG 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) - TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

His His Val Lys Leu lie Tyr Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Lys Tyr Asp Ala Val He Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Glu Glu Asn Gin Val Val Leu 

1 ' ' 5 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



Pro Asp Trp Lys lie Arg Lys Glu 
1 5 



"(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATGCTACTTC CAGACTGGAA AATCAGAAAA GAAATACTTA TAGAGCCATT TTCTGAAGAA 60 

TCGCTCCAAC CAGCAGGTTA TGACCTCAGA GTGGGCAGAG AGGCTTTTGT TAAGGGGAAA 120 

TTAATCGACG TGGAAAAGGA AGGAAAAGTC GTTATTCCTC CAAGGGAATA CGCCTTAATC 180 

CTAACCCTCG AGAGGATAAA GTTGCCCGAC GATGTTATGG GGGATATGAA GATAAGGAGC 240 

AGTTTAGCAA GAGAAGGGGT TATTGGTTCT TTTGCTTGGG TTGACCCAGG ATGGGATGGA 300 

AACTTAACAC TAATGCTCTA CAATGCCTCA AATGAACCTG TCGAATTAAG ATATGGAGAG 360 

AGATTTGTGC AGATCGCATT TATAAGGCTA GAGGGTCCGG CAAGAAACCC TTACAGAGGA 420 

AACTATCAGG GG AGCACAAG GTTAGCGTTT TCAAAGAGAA AGAAACTCTA G 471 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : u nknown 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Leu Leu Pro Asp Trp Lys He Arg Lys Glu He Leu He Glu Pro 
15 10 15 

Phe Ser Glu Glu Ser Leu Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 
20 25 30 

Arg Glu Ala Phe Val Lys Gly Lys Leu lie Asp Val Glu Lys Glu Gly 
35 40 45 

Lys val val He Pro Pro Arg Glu Tyr Ala Leu He Leu Thr Leu Glu 
50 55 60 

Arg He Lys Leu Pro Asp Asp Val Met Gly Asp Met Lys He Arg Ser 
65 70 75 

ser Leu Ala Arg Glu Gly val He Gly Ser Phe Ala Trp Val Asp Pro 



85 



90 



Gly Trp Asp Gly Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu 
100 105 110 

val Glu Leu Arg Tyr Gly Glu Arg Phe Val Gin He Ala Phe He 



Pro 

115 



120 125 



Arg Leu Glu Gly Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly 

^ ^ ^ — 1 An 

130 



135 140 



ser Thr Arg Leu Ala Phe Ser Lys Arg Lys Lys Leu 
145 



150 155 



2) INFORMATION FOR SEQ ID NO: 72: 



(2) 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: cunino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 




Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
15 10 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ala Gly Trp He Asp Ala Gly Phe Lys Gly Lys He Thr Leu 
15 10 

(2) information' FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 





(ii) 


MOLECULE TYPE: peptide 




(iii) 


HYPOTHETICAL: NO 




(iv) 


ANT I -SENSE: NO 


\ ll 


(V) 


FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Pro Thr He Val Asp Ala Gly Phe Glu Gly Gin Leu Thr He 

Q 15 10 

Q (2) INFORMATION FOR SEQ ID NO: 77: 

Q (i) SEQUENCE CHARACTERISTICS: 

n (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Ala His Arg He Asp Pro Gly Trp Ser Gly Cys He Val Leu 
15 10 




(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Vai Gly Leu He Asp Ser Asp Tyr Gin Gly Gin Leu Met He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
{ D ) TOPOLCX5Y : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Ala Gly Val Val Asp Arg Asp Tyr Thr Gly Glu Val Lys Val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(LV) ANTI -SENSE: NO. 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ala Gly Val He Asp Glu Asp Tyr Arg Gly Asn Val Gly Val 

15 10 ■ 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Thr Gly Leu He Asp Pro Gly Phe Gin Gly Glu Leu Lys Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GACGACGACA AGATGCTACT TCCAGACTGG AAA 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
GGAACAAGAC CCGTCCCACT TTCACAGATG AAGAG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

CTCCATGTCC CAACTCCGAT CAC 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 



• 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
GGTTTTCCCA GTCACGACGT TGTAAAACGA CGGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GGUUUUCCCA GUCACGACGU UGUAAAACGA CGGCCAGU 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI_-&ENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 
GACGACGACA AGATGCCCTG CTCTGAAGAG ACACC 



(2) INFORMATION FOR SEQ ID NO: 89: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) T0P0LCX5Y : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89 



GGAACAAGAC CCGTTTAATT CTTTCCAGTG GAACC 



